Stochastic resolution of the LHC inverse problem 
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In this work the LHC inverse problem is quantified in the Bayesian context by clarifying the 
relation between the mapping fi'om the theory parameter space to experimental signature space and 
the inverse map. We demonstrate that, after complementing the LHC data by existing astrophysical, 
collider, and low energy measurements, a simple likelihood analysis is able to significantly reduce 
the inverse problem. The presented approach offers a robust, economical, and extendable way to 
extract theoretical parameters from the LHC, and other experimental, data. 
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I. INTRODUCTION 

The CERN Large Hadron Collider (LHC) will soon 
start colliding protons at 14 TeV center of mass energy. 
The LHC data is expected to shed light on the origin of 
mass of elementary particles, and discover precursors of 
a wider theory beyond the standard particle model (SM) . 
Whatever this new theory might be, it is expected to be 
described by a Lagrangian having several unknown pa- 
rameters. Our goal is the extraction of these theoretical 
parameters from experimental data. 

The feasibility of this goal was studied quantitatively 
within the context of the minimal supersymmetric stan- 
dard model (MSSM) in Ref. [1]. The authors found 
that a significant proportion of LHC signatures map to 
the theoretical parameter space with a large uncertainty. 
They isolated pairs of points located relatively far from 
each other in the MSSM parameter space producing in- 
distinguishable LHC signals. This implies that the LHC 
would not be able to resolve certain model parameter re- 
gions with sufficient precision. The difficulty was coined 
as the LHC inverse problem. 

Meanwhile, the task of parameter extraction has been 
attacked from another direction. Numerous authors per- 
form global fits of supersymmetric models using presently 
available ex2erimental data 0, i, i, i, i, 0, Wu, IS El, 
m, [H, HI, [iBl ■ These global analyses typically evaluate 
the likelihood of certain experimental outcomes assum- 
ing the constrained version of the MSSM as the theo- 
retical model, spanned by a few parameters. The calcu- 
lated likelihoods determine the probability distributions 
of each of the theoretical parameters through marginal- 
ization. Even though this procedure maps the signature 
space to the theoretical parameter space, so far there has 
been no explicit connection established with the inverse 



problem. 

In this work, we point out that the inverse problem 
can be rigorously quantified, and simply connected to 
the likelihood analyses within the framework of Bayesian 
probability theory. First, we quantify a map from signa- 
ture to theory space by introducing the probability that 
the theoretical parameters are restricted to certain ranges 
by a given set of experimental data. Then, we represent 
the map from theory to signature space by the likelihood 
that a certain set of data is predicted by a theory with a 
given set of parameters. Since Bayes' theorem explicitly 
connects the two probabilities, the stochastic framework 
offers us a robust way to attack the inverse problem. Uti- 
lizing this statistical formulation, we demonstrate a sim- 
ple way in which the inverse problem can be substantially 
reduced by using results from the LHC and other col- 
liders, electroweak precision experiments, rare b-decays, 
and various astrophysical observations. 



II. THE INVERSE PROBLEM IN THE 
STOCHASTIC FRAMEWORK 



In this section, we recapture the problem of theoreti- 
cal parameter extraction within the context of probabil- 
ity theory. We begin by defining a mapping from the 
experimental signature space to the theoretical parame- 
ter space. In the context of the inverse problem this is 
referred to as the inverse map. 

Given a theoretical model T and a set of experimen- 
tal data D, we introduce the probability V{P\T;D) that 
the theoretical model describes the data with the theory 
parameters set to values P. In the theory of probabili- 
ties V{P\T] D) is known as the conditional probability of 
the occurrence of P provided that T is assumed while D 
holds. According to Bayes' theorem, this probability can 
be simply calculated as 
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Here V{D\T\ P) is the likelihood that a certain set of 
data is predicted by a theoretical model with a speci- 
fied set of theory parameters. The probability V{T;P) 
gives the a-priori distribution of the parameters within 
the theory, fixed by only theoretical considerations in- 
dependently from the data. Finally, the evidence V{D) 
gives the integrated likelihood of the theory T in terms 
of the data alone. From unitarity it follows that 

V{D)^ Jv{D\T;P)V{T;P)dP, (2) 

where the integral extends over the whole parameter 
space, P = pi, ...,pn, of the theory. 

The likelihood function 7^(D|T; P) maps forward, from 
the theory space to the signature space. If the data under 
consideration are independent (as in our case) 'P{D\T; P) 
factorises as: 

V{D\T-P) = WC,{D\T-P), (3) 

i 

where the product is over i different experiments with Ci 
corresponding to the the likelihood function. Each 
term in the product is formed as a convolution 

C,{D\T- P) = ® Lu{T; P). (4) 

If the experimental data and theoretical predictions are 
normally distributed then Ci_e and Ci^t are Gaussians and 

AP|T; P) = -L- expixUD, T, P)/2). (5) 

The exponents of the likelihood functions 

xUD,T,P)/2 = id,-U{P)r/2a^ (6) 

are defined in terms of the experimental data D — 
{di ± (Ji^e} and theoretical predictions for these measur- 
ables {ti ± Ui,*}. Independent experimental and theoret- 
ical uncertainties combine into af = af^ + af^. 

In cases when the experimental data only specify a 
lower (or upper) limit, the likelihood function can be 
written in terms of the error function 

r^T-P) = lerf(^xf(^,T,P)/2). (7) 

In the stochastic context the task of theoretical pa- 
rameter extraction, the inverse mapping from signature 
space to parameter space, boils down to the determina- 
tion of the posterior probability distribution 'P{P\T; D) 
describing the probability that the parameters of a given 
theory have certain values in light of a certain set of data. 
This can be calculated by evaluating the right hand side 
of Bayes' theorem. A parameter is well determined by 
the data if the variance of the ■p(P|T; D) distribution is 
small. This means that, the inverse problem can be quan- 
tified in terms of the likelihood function, the theoretical 
prior and the experimental evidence. 



To quantify the inverse process we introduce a quantity 
called the experimental differentiating power, which ties 
the inverse map to the forward map via Bayes' theorem. 
Given two points Pi and P2 in the parameter space of a 
theory, we define the experimental differentiating power 
in terms of the ratio of the prior probabilities: 

V = l-V{Pi\T-D)/V{P2\T;D), (8) 

where we can always arrange that < V{Pi\T\D) < 
V{P2\T]D) < 1 ensuring < I? < 1. Regardless of 
the form of the theoretical prior V{T; D) and the exper- 
imental evidence V{D), the above discriminator can be 
expressed, via Bayes' theorem, as 

V = l-V{D\T;Pi)/V{D\T;P2). (9) 

According to Eq.®, if the theory at parameter point 
Pi fits the data just as well as at point P2, then the data 
has little differentiating power between the two model 
points. Conversely, V can only be large if the data sig- 
nificantly prefer P2 over Pi, that is ff P(D|r;P2) > 
'PiD\T;Pi). As formulated in Eq.®, V gives the com- 
bined differentiating power of all experiments in the prod- 
uct of Eq.(l3]). We could also calculate the differentiating 
power of a single experiment Di by including only the i*'' 
term in evaluating V{D\T; P) in Eq.Q. In the next sec- 
tion, we show that T) is, indeed, a robust discriminator in 
the case of the MSSM and it can be used to significantly 
reduce the degeneracy in the inverse map. 

III. THE INVERSE PROBLEM IN THE MSSM 

In this section, we illustrate the discrimination power 
of the Bayesian approach using the MSSM. The authors 
of Ref. found that out of 43026 randomly selected 
model points 283 pairs had indistinguishable LHC signa- 
tures. Since they were using discrete experimental data 
(such as number of jets, etc.), the T) values that they 
could have calculated are only or 1. Their indistin- 
guishable pairs represent models where the differentiat- 
ing power of the LHC data vanishes. 

Of these 283 pairs, 162 were analyzed by the authors of 
[3^ [3^ to determine the capability of the International 
Linear Collider (ILC) to resolve the inverse problem. As- 
suming a center of mass energy of 500 GeV, only one third 
(57) of the pairs were found to be distinguishable at 5 ct 
confidence level. 

We start with the 283 degenerate MSSM model pairs 
of Ref. [1] and observe that they consist of 384 unique 
model points. This implies that some of the models had 
more than one degenerate partner, indicating that there 
were larger degenerate groups of models. In the popula- 
tion of 384 models we determine the degeneracy groups, 
such that one model only belongs to just one group, ev- 
ery model within a group is degenerate with every other 
model within the group, and no model in any group is de- 
generate with a model in a different group. We then find 
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TABLE I: Observables used to reduce the inverse problem in the MSSM. Experimental results are listed under column diitai,, 
and typical uncertainties related to the MSSM calculations are under ai^f 



observable 



limit type 



di ± cTi 



lightest Higgs mass 

lighter chargino mass 

electroweak precision parameter 

muon anomalous magnetic moment 

rare b-decay branching fraction 

rare b-decay branching fraction 

average abundance of dark matter 

spin independent WIMP-proton elastic recoil 

gamma ray emission flux from the galactic center 



B{b - 



Apo 

-+ s-y) 



CSI 



lower limit 
lower limit 
central value 
central value 
central value 
upper limit 
upper limit 
upper limit 
upper limit 



91.0 - 114.4 GeV [16] 
103.5 GeV [18] 
(4.0111]) X 10"* [20] 
(29.0 ± 9.0) X 10-1" [21J 
(3.50 ± 0.17) X 10"'' [22] 
4.7 xlO"* [24] 
0.1143 [26| 
CDMS 2008 [28] 
1.0 xlO-i° [301 



3.0 GeV [17] 
1.0 GeV [19] 
negligible 
negligible 
10 % [23] 
10 % [25] 
10 % [27] 
20 % [29] 
100 % [31] 



all possible pairings of models within a particular degen- 
eracy group. This allows us to increase our population 
of model pairs from 283 to 802 degenerate pairs. 

For our 802 model pairs, we calculate the discriminator 
T) utilizing experimental data beyond the LHC. We select 
measurables from a wide range of experiments: collider 
limits on particle masses, precision electroweak observ- 
ables, rare decay branching fractions, and astrophysical 
observations. All this experimental data is available now 
as indicated in TABLE [Tl 

The theoretical model predictions, as the function of 
the MSSM soft parameters ti{P), were calculated using 
a modified version of ISATools and micrOMEGAs [23| 
software packages. (In this work, we only show results 
from micrOMEGAs.) In both cases ISAJET 7.71 [H 
was used to calculate the physical mass spectrum without 
two-loop corrections to exactly match the spectrum of 

Ref.s [itllS,!!!. 

In FIG.[T1 we show the individual discrimination power 
of V for several different observables. In each frame only 
a single observable (one term) is used in the likelihood 
functions defining V, in correspondence with TABLE [H 

The first two frames of FIG. [1] show "D for mass limits 
from colliders. According to the Higgs masses calculated 
by ISAJET 7.71 the original sample contained a few mod- 
els that did not respect the LEP Higgs mass limit. These 
were eliminated, at varying confidence level, by the term 
of the likelihood function containing mh. Similarly, the 
second frame indicates that a few model points are ex- 
cluded based on the mass limit on the lighter chargino 
from LEP. When these models are paired with others the 
discrimination power 2? is very close to 1. 

The two subsequent frames show V for electroweak 
precision variables. Frame 3 of FIG. [1] reflects on the 
weak discrimination power of Apo^ the degeneracy is 
broken by Apo in only a small fraction of model pairs 
at a high confidence level, and the discrimination power 
vanishes for most of the models. As frame 4 of the 
figure shows, the anomalous magnetic moment of the 
muon clearly discriminates between about 100 models 
but its discrimination power diminishes for about half 



of the pairs. This might be the result of biased selec- 
tion. The 384 MSSM model points were selected from a 
parameter region that can be probed by the LHC, that 
is from a region where superpartner masses are bounded 
from above. Present collider lower limits on superpartner 
masses were also enforced on the sample, so superpart- 
ner masses fall into a relatively narrow window. In this 
window, we expect moderate supersymmetric contribu- 
tions to , which accounts for a moderate discrimination 
power. 

The next two frames of FIG. [T] show the discrimination 
power of two rare b-decay branching fractions B{b sj) 
and B{Bs — > /i+p^). These tell a similar story to that 
of Aa^. Here B{Bs appears to be excellent 

discriminator mostly because the supersymmetric contri- 
bution to it varies sensitively with tan/3, the ratio of the 
two Higgs vacuum expectation values. (A similar conclu- 
sion was reached in ,34] . ) This sharp variation results in 
a clean discrimination in model pairs where one of the 
points has high and the other low tan /3. 

The discriminating power of the relic abundance of 
neutralinos, featured in frame 7, is also sharp. Although 
flh^ can differentiate between only a quarter of the pairs 
it differentiates at high confidence levels. This is be- 
cause the theoretical prediction for fl^^ tends to vary 
quickly (relative to the uncertainty in fl) over the pa- 
rameter space. 

The upper limit on the spin independent WIMP- 
proton elastic recoil cross section, shown in frame 8, is 
able to split a few models but its discrimination is not 
particularly high. This is reflecting the well known fact 
that the direct detection experiments just started to be- 
come sensitive to the MSSM parameter space. Also, the 
calculation of the direct detection cross section has a siz- 
able uncertainty softening discrimination in this case. 

The last frame of FIG. [T] shows gamma ray emission 
flux ^^'-^ from the galactic center as a promising discrim- 
inator. Even with a 100 % theoretical error, a consider- 
able number of the model pairs are discriminated with a 
high confidence. This discrimination strongly depends on 
the dark matter halo profile used in the calculation of the 
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0.5 1 0.5 1 0.5 1 
Discriminator 

FIG. 1: Discrimination power of various observables for the 
case of tlie MSSM. The number of model pairs plotted against 
the discriminator V=l- V{D\T; Pi)/P{D\T; P2). Only one 
observable (a single term) is used in the likelihood functions 
V in each frame. 



gamma ray flux (we use the popular NFW profile [35|). 
A conclusion quantitatively similar to ours was reached 
by the authors of Ref. [31] using the same profile. 

Based on the discrimination power of D for the individ- 
ual cases, we already see that the combination of the set 
of the nine selected experiments can result in potentially 
strong discrimination. This is indeed the case, when we 
calculate the combined differentiating power of all ex- 
periments using Eq.Q. Our final result can be simply 
summarized: out of the 802 model pairs 799 are distin- 
guished at higher than 99.999 % confidence level and only 
one of the model pairs cannot be resolved at better than 
65 % confidence level. In conclusion, we can state that 
as far as the examined sample is concerned the inverse 
problem is resolved by our analysis. 



Assuming that the original set of 43026 model points 
was a representative sample, we expect that a more elab- 
orate and higher statistics scan will lead to the same qual- 
itative conclusion. Even if, applying our approach to a 
higher statistics sample, one finds a higher number of de- 
generate model pairs our method allows for straightfor- 
ward inclusion of more data. This is important, because 
near future data from the Tevatron, the LHC, (in)direct 
detection of dark matter, b-physics and astrophysical ob- 
servation will be even more constraining on supersymme- 
try and provide a higher level of discrimination. 



IV. CONCLUSIONS 

In this work we quantified the LHC inverse problem 
in the context of the Bayesian probability theory. We 
constructed a robust discriminator, from the ratio of 
likelihoods of an arbitrary model pair, which powerfully 
distinguished essentially all examined MSSM parameter 
points. To achieve this, we combined data from a range 
of collider, astrophysical, and low energy observables. 

To reduce the inverse problem for a general theory, all 
available experimental information should be included. 
The stochastic analysis presented above allows us to 
achieve this. In the context of our analysis the differenti- 
ating power of all, or any individual, experiments can be 
simply quantified in terms of confidence levels. While we 
treated all experiments on the same footing, weighting 
can easily be introduced. 
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